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DETAILED ACTION 

1. Claims 1-31 are pending in this application. 

Claim Rejections - 35 USC § 103 

2. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 



3. Claims 1-5, 7, 16-20, 22, 24-28, 30 are rejected under 35 USC 103(a) as being 
obvious over . Bergamaschi et al. (Bergamaschi et al., Semantic Integration of 
Semistructured and Structured Data Sources, SIGMOD Record, Vol. 28, No. 1, March 
1999, pp. 54-59) (hereinafter Bergamaschi), in view of Squire (Squire, Cass, Data 
Extraction and Transformation for the Data Warehouse, ACM, SIGMOD, 1995, pp. 446- 
447.), 



4. Regarding claim 1, Bergamaschi discloses a computer-implemented method for 
mapping a schema to a disparate schema, comprising: 

matching columns of a data schema to corresponding columns of another data 
schema to provide a mapping (page 56, column 1 , paragraph 2); and 
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updating a matching resource based on the mapping (Section 1.1, Generation of 
a Common Thesaums, paragraph 1 ; a Common Thesaurus as disclosed by 
Bergamaschi). 

Bergamaschi does not explicitly disclose determining whether data within 
matching columns of the user data schema has a data type different than data within 
the corresponding columns of the mining model schema; and transforming the data 
within the matching columns of the user data schema if the data type is determined to 
be different; and also does not explicitly disclose wherein if the data types of matching 
columns are different data from the first column is transformed. 

In the same field of endeavor (integration of disparate data) Squire discloses 
transforming data from a user data environment to a data warehouse environment 
(page 447, column 1, paragraph 2). Determining whether a transformation needs to be 
done before performing a transformation was well known in the art at the time the 
invention was made. 

Accordingly, it would have been obvious to a person of ordinary skill in the art at 
the time the invention was made to have incorporated Squire's teachings of 
transformation of data in a user data schema to data in a data warehouse schema with 
Bergamaschi's method for matching columns of a data schema to corresponding 
columns of another schema and updating a matching resource to obtain the claimed 
invention. Squire suggests in the last paragraph of column 1, page 447, that automated 
tools can reduce time and cost of implementation of a data warehousing system, and 
further suggests in the same paragraph a combination with automated tools that 
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"actively capture and maintain meta data related to source files... <and> 
transformations and mappings". Bergamaschi suggests a need in paragraph 1 of page 
54 to make integration as semi-automated and scalable as possible. 

Regarding claim 2, Bergamaschi and Squire do not explicitly disclose providing an 
opportunity to manually alter the mapping after transforming the data; and presenting a 
final view of the mapping after providing the opportunity, wherein the updating step is 
performed after the final view is presented. 

In the same field of endeavor (semantic discovery and mapping between data 
sources), Gorelik discloses providing an opportunity to manually alter the mapping after 
transforming the data, and presenting a final view of the mapping after providing the 
opportunity, wherein the updating step is performed after the view is presented (Figure 
7; paragraphs 0530 and 0531; see also paragraph 0538). 

Accordingly, it would have been obvious to a person of ordinary skill in the art at 
the time the invention was made to have incorporated Gorelik's teachings of providing 
an opportunity to manually alter the mapping after transforming the data, presenting a 
final view and updating, with Bergamaschi's and Squire's suggested teachings of a 
computer implemented method of mapping a user data schema to a mining model 
schema to obtain a system wherein an opportunity is provided to change a mapping 
between a user data schema and a data mining schema. Gorelik suggests the need for 
an improved automated way to discover mappings in paragraph 0002. Bergamaschi 
states in section 1.1, Generation of a Common Thesaurus, paragraph 2, that the 
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process of discovering mappings is a semi-automatic process wherein a designer is 
aided by automated tools. 

Regarding claim 3, wherein the matching step comprises determining whether names of 
the columns of the user data schema exactly match names of the columns of the mining 
model data schema (Subsection Automated extraction of relationships, paragraph 2; 
see also Figure 4; Bergamaschi discloses finding synonyms, and shows in figure 4, 
where a synonym is inclusive of an exact match of a column name: "ID. Patient SYN 
CD.Patienf ). 

Regarding claim 4, wherein the matching step further comprises determining whether 
the names of the columns of the user data schema are similar to the names of the 
columns of the mining model data schema based on the matching resource (Subsection 
Automated extraction of relationships, paragraph 2). 

Regarding claim 5, wherein the matching step comprises determining whether the 
names of the columns of the user data schema match the names of the columns of the 
mining model schema based on one or more formulae (Section 1.1, Generation of a 
Common Thesaurus, paragraph 1; Bergamaschi discloses matching column names that 
are not exact or synonyms, but rather represent the same type of data in the disclosure 
of broad and narrow term relationships, which is the equivalent of formula based 
matching). 
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Regarding claim 7, wherein the matching resource is selected from the group consisting 
of a thesaurus, a dictionary and a similarity threshold (Section 1.1, Generation of a 
Common Thesaurus, paragraph 1). 

5. Claims 6, 8, 9-15, 21, 23, 29, and 31 are rejected under 35 USC 103(a) as being 
obvious over . Bergamaschi et al. (Bergamaschi et al., Semantic Integration of 
Semistructured and Structured Data Sources, SIGMOD Record, Vol. 28, No. 1 , March 
1999, pp. 54-59) (hereinafter Bergamaschi), in view of Squire (Squire, Cass, Data 
Extraction and Transformation for the Data Warehouse, ACM, SIGMOD, 1995, pp. 446- 
447.), in further view of (U.S. Patent Publication #: 2005/0055369). 

Regarding claim 6, Bergamaschi and Squire do not explicitly disclose wherein the 
matching step further comprises determining whether the data within the columns of the 
user data schema corresponds to the data within the columns of the mining model data 
schema. 

In the same field of endeavor (semantic discovery and mapping between data 
sources), Gorelik discloses wherein the matching step further comprises determining 
whether the data within the columns of the user data schema corresponds to the data 
within the columns of the mining model data schema (paragraph 0341 ; see also 
paragraphs 0346-0349). 
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Accordingly, it would have been obvious to a person of ordinary skill in the art at 
the time the invention was made to have incorporated Gorelik's teachings of 
determining whether the data within the columns of the user data schema corresponds 
to the data within the columns of the mining model data schema, with Bergamaschi's 
and Squire's suggested teachings of a computer implemented method of mapping a 
user data schema to a mining model schema to obtain a system wherein an opportunity 
is provided to change a mapping between a user data schema and a data mining 
schema. Gorelik suggests the need for an improved automated way to discover 
mappings in paragraph 0002. Bergamaschi suggests in paragraph 1 of page 54 a need 
to determine if disparate sources contain semantically related information. 

Regarding claim 8, Bergamaschi and Squire do not explicitly disclose populating a 
schema consolidation table with names of the columns of the mining model schema, 
prior to the matching step; and updating the schema consolidation table with names of 
the matching columns of the user data schema, during the updating step. 

In the same field of endeavor (semantic discovery and mapping between data 
sources), Gorelik teaches the use of a match table (the equivalent of a schema 
consolidation table) indicating mappings between columns (paragraphs 0091 and 
0095). 

Accordingly, it would have been obvious to a person of ordinary skill in the art at the 
time the invention was made to have incorporated Gorelik's teachings of a schema 
consolidation table, with Bergamaschi's and Squire's suggested teachings of a 
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computer implemented method of mapping a user data schema to a mining model 
schema to obtain a system wherein an opportunity is provided to change a mapping 
between a user data schema and a data mining schema. Gorelik suggests the need for 
an improved automated way to discover mappings in paragraph 0002. Bergamaschi 
suggests in paragraph 2 of page 54 taking structured data sources and semistructured 
data into account for the purposes of capturing and reasoning about semantic aspects 
of schema descriptions of heterogeneous information sources for supporting integration. 

6. Regarding claim 9, Bergamaschi discloses a computer-implemented method for 
mapping a schema to a disparate schema, comprising: 

matching columns of a data schema to corresponding columns of another data 
schema to provide a mapping (page 56, column 1 , paragraph 2); and 

updating a matching resource based on the mapping (Section 1.1, Generation of 
a Common Thesaurus, paragraph 1 ; a Common Thesaurus as disclosed by 
Bergamaschi). 

Bergamaschi does not explicitly disclose determining whether data within 
matching columns of the user data schema has a data type different than data within 
the corresponding columns of the mining model schema; and transforming the data 
within the matching columns of the user data schema if the data type is determined to 
be different; and also does not explicitly disclose wherein if the data types of matching 
columns are different data from the first column is transformed. 
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In the same field of endeavor (integration of disparate data) Squire discloses 
transforming data from a user data environment to a data warehouse environment 
(page 447, column 1, paragraph 2). Determining whether a transformation needs to be 
done before performing a transformation was well known in the art at the time the 
invention was made. 

Accordingly, it would have been obvious to a person of ordinary skill in the art at 
the time the invention was made to have incorporated Squire's teachings of 
transformation of data in a user data schema to data in a data warehouse schema with 
Bergamaschi's method for matching columns of a data schema to corresponding 
columns of another schema and updating a matching resource to obtain the claimed 
invention. Squire suggests in the last paragraph of column 1 , page 447, that automated 
tools can reduce time and cost of implementation of a data warehousing system, and 
further suggests in the same paragraph a combination with automated tools that 
"actively capture and maintain meta data related to source files... <and> 
transformations and mappings". Bergamaschi suggests a need in paragraph 1 of page 
54 to make integration as semi-automated and scalable as possible. 

Bergamaschi and Squire do not explicitly disclose providing an opportunity to 
manually alter the mapping after transforming the data; and presenting a final view of 
the mapping after providing the opportunity, wherein the updating step is performed 
after the final view is presented. 

In the same field of endeavor (semantic discovery and mapping between data 
sources), Gorelik discloses providing an opportunity to manually alter the mapping after 
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transforming the data, and presenting a final view of the mapping after providing the 
opportunity, wherein the updating step is performed after the view is presented (Figure 
7; paragraphs 0530 and 0531; see also paragraph 0538). 

Accordingly, it would have been obvious to a person of ordinary skill in the art at 
the time the invention was made to have incorporated Gorelik's teachings of providing 
an opportunity to manually alter the mapping after transforming the data, presenting a 
final view and updating, with Bergamaschi's and Squire's suggested teachings of a 
computer implemented method of mapping a user data schema to a mining model 
schema to obtain a system wherein an opportunity is provided to change a mapping 
between a user data schema and a data mining schema. Gorelik suggests the need for 
an improved automated way to discover mappings in paragraph 0002. Bergamaschi 
states in section 1.1, Generation of a Common Thesaurus, paragraph 2, that the 
process of discovering mappings is a semi-automatic process wherein a designer is 
aided by automated tools. 

Bergamaschi and Squire do not explicitly disclose populating a schema 
consolidation table with names of the columns of the mining model schema, prior to the 
matching step; and updating the schema consolidation table with names of the 
matching columns of the user data schema, during the updating step. 

In the same field of endeavor (semantic discovery and mapping between data 
sources), Gorelik teaches the use of a match table (the equivalent of a schema 
consolidation table) indicating mappings between columns (paragraphs 0091 and 
0095). 
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Accordingly, it would have been obvious to a person of ordinary skill in the art at 
the time the invention was made to have incorporated Gorelik's teachings of a schema 
consolidation table, with Bergamaschi's and Squire's suggested teachings of a 
computer implemented method of mapping a user data schema to a mining model 
schema to obtain a system wherein an opportunity is provided to change a mapping 
between a user data schema and a data mining schema. Gorelik suggests the need for 
an improved automated way to discover mappings in paragraph 0002. Bergamaschi 
suggests in paragraph 2 of page 54 taking structured data sources and semistructured 
data into account for the purposes of capturing and reasoning about semantic aspects 
of schema descriptions of heterogeneous information sources for supporting integration. 

Regarding claim 10, wherein the matching step comprises determining whether names 
of the columns of the user data schema exactly match names of the columns of the 
mining model data schema (Subsection Automated extraction of relationships, 
paragraph 2; see also Figure 4; Bergamaschi discloses finding synonyms, and shows in 
figure 4, where a synonym is inclusive of an exact match of a column name: "ID. Patient 
SYN CD.Patient"). 

Regarding claim 1 1 , wherein the matching step further comprises determining whether 
the names of the columns of the user data schema are similar to the names of the 
columns of the mining model data schema based on the matching resource (Subsection 
Automated extraction of relationships, paragraph 2). 
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Regarding claim 12, wherein the matching step further comprises determining whether 
the names of the columns of the user data schema are similar to the names of the 
columns of the mining model data schema based on the matching resource (Subsection 
Automated extraction of relationships, paragraph 2). 

Regarding claim 13, Bergamaschi and Squire do not explicitly disclose wherein the 
matching step further comprises determining whether the data within the columns of the 
user data schema corresponds to the data within the columns of the mining model data 
schema. 

In the same field of endeavor (semantic discovery and mapping between data 
sources), Gorelik discloses wherein the matching step further comprises determining 
whether the data within the columns of the user data schema corresponds to the data 
within the columns of the mining model data schema (paragraph 0341; see also 
paragraphs 0346-0349). 

Accordingly, it would have been obvious to a person of ordinary skill in the art at 
the time the invention was made to have incorporated Gorelik's teachings of 
determining whether the data within the columns of the user data schema corresponds 
to the data within the columns of the mining model data schema, with Bergamaschi's 
and Squire's suggested teachings of a computer implemented method of mapping a 
user data schema to a mining model schema to obtain a system wherein an opportunity 
is provided to change a mapping between a user data schema and a data mining 



Application/Control Number: 10/706,546 Page 13 

Art Unit: 2167 

schema. Gorelik suggests the need for an improved automated way to discover 
mappings in paragraph 0002. Bergamaschi suggests in paragraph 1 of page 54 a need 
to determine if disparate sources contain semantically related information. 

Regarding claim 14, wherein the matching resource is selected from the group 
consisting of a thesaurus, a dictionary and a similarity threshold (Section 1.1 , 
Generation of a Common Thesaurus, paragraph 1). 

Regarding claim 15, Bergamaschi does not explicitly disclose populating a schema 
consolidation table with names of the columns of the mining model schema, prior to the 
matching step; and updating the schema consolidation table with names of the 
matching columns of the user data schema, during the updating step. 

In the same field of endeavor (semantic discovery and mapping between data 
sources), Gorelik teaches the use of a match table (the equivalent of a schema 
consolidation table) indicating mappings between columns (paragraphs 0091 and 
0095). 

Accordingly, it would have been obvious to a person of ordinary skill in the art at 
the time the invention was made to have incorporated Gorelik's teachings of a schema 
consolidation table, with Bergamaschi's and Squire's suggested teachings of a 
computer implemented method of mapping a user data schema to a mining model 
schema to obtain a system wherein an opportunity is provided to change a mapping 
between a user data schema and a data mining schema. Gorelik suggests the need for 
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an improved automated way to discover mappings in paragraph 0002. Bergamaschi 
suggests in paragraph 2 of page 54 taking structured data sources and semi-structured 
data into account for the purposes of capturing and reasoning about semantic aspects 
of schema descriptions of heterogeneous information sources for supporting integration. 

7. Claims 16-23 are essentially the same as claims 1-8 except that they set forth 
the claimed invention as a system rather than a method and are rejected for the same 
reason as applied hereinabove. 

8. Claims 24-31 are essentially the same as claims 1-8 except that they set forth 
the claimed invention as a product stored on a computer readable medium rather than a 
method and are rejected for the same reason as applied hereinabove. 

Contact Information 

9. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Tarik C. Koc whose telephone number is 571-272-6725. 
The examiner can normally be reached on 8:30-5:00. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, John Breene can be reached on 571-272-4107. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
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